Method of correlating power in a data center by fitting a function to a plurality of pairs of actual power draw values and estimated power draw values determined from monitored CPU utilization of a statistical sample of computers in the data center

ABSTRACT

The subject matter of this specification can be embodied in, among other things, a method that includes supplying power to a portion of a data center through a power distribution line. Utilization of a statistically significant sample of the computers is monitored, and an estimated individual power draw for each of the computers based on the utilization is calculated. An estimated total power draw is calculated for different times from the estimated individual power draws to generate a collection of estimated total power draw values for the different times. Actual power draw is monitored at the power distribution line and a collection of actual power draw values is generated. A function is fitted to pairs of actual power draw values and estimated power draw values, each pair comprising an actual draw value and an estimated draw value for the same time, and the function is then stored.

TECHNICAL FIELD

Various implementations relate generally to electrical power distribution.

BACKGROUND

In general, power distribution systems receive high voltage and/or high current electrical power from a utility provider, generator station, or other source of power. The power distribution systems may transform the received power to electrically powered equipment, such as the computers and cooling equipment in a data center. Electrical power is generally conducted by high current conductors that may be split into two or more branch conductors to divide and distribute electrical power. Some of these branches may be split to further divide and distribute electrical power. Each of the electrical conductors may be protected by circuit breakers, and/or other over-current protection devices to stop the flow of electrical currents in excess of the conductors' ratings.

SUMMARY

In general, this document describes systems, apparatus, and methods relate to current measurement devices that compare a load's estimated power draw to a measured power draw, and uses that feedback to improve the accuracy of subsequent power draw estimates. In an exemplary implementation, a power monitoring module (PPM) may estimate the power draw of a computer or group of computers based on a computing load assigned to the computer(s), measure the computer's actual power draw, and then use the measurement to determine a more accurate model for predicting the power draw of the computer(s) while operating at various computing loads. A more accurate model can permit a facility to operate closer to the maximum capacity of the power distribution system, with limited risk of exceeding the maximum capacity.

In a first aspect, a method of correlating power in a data center comprises supplying power to a portion of a data center through a power distribution line, the portion of the data center including a plurality of computers that draw power through a circuit breaker. Utilization of at least a statistically significant sample of the plurality of computers is monitored, and an estimated individual power draw for each of the sample of the plurality of computers based on the utilization is calculated. An estimated total power draw is calculated for each of a plurality of different times from the estimated individual power draw of each of the sample of the plurality of computers to generate a plurality of estimated total power draw values for the plurality of different times. Actual power draw is monitored at the power distribution line by the portion of the data center and a plurality of actual power draw values is generated for the plurality of different times. A function is fitted to a plurality of pairs of actual power draw values and estimated power draw values, each pair of the plurality of pairs comprising an actual draw value and an estimated draw value for the same time, and the function is then stored.

Various implementations can include some, all, or none of the following features. The utilization can be CPU utilization. Calculating the estimated individual power draw can include storing data representing a computer configuration for each of the plurality of computers, and the computer configuration can be an input to a function relating utilization to estimated individual power. The computer configuration can include one or more of processor speed, amount of memory or number of disk drives. Calculating the estimated total power draw can include summing the estimated individual power draw of each of the sample of the plurality of computers. The sample can include substantially all of the plurality of computers. The function can be a linear function. The method can also include receiving a maximum power draw value for the portion of the data center, calculating an estimated current individual power draw value for each of the sample of the plurality of computers based on the utilization, calculating an estimated current total power draw value from the estimated current individual power draw value of each of the sample of the plurality of computers, at least one of calculating an adjusted maximum power draw value from the maximum power draw value and the function, and comparing the estimated current total power draw value to the adjusted maximum power draw value, or calculating an adjusted estimated current total power draw value from the estimated current total power draw value and the function, and comparing the adjusted estimated total power draw value to the maximum power draw value, and adjusting operation of at least one computer in the plurality of computers based on the comparison. Adjusting operation can include one or more of adjusting job allocation, adjusting job scheduling or adjusting central processing unit execution frequency. Calculating an estimated individual power draw for a computer of the plurality of computers can include measuring utilization of the computer at a plurality of utilization measurement times to generate a plurality of utilization values for the computer, there being a greater number of utilization measurement times than different times such that there are a greater number of utilization values for the computer than different times. The method can further include associating each of the plurality of utilization values with one of the plurality of different times. Associating can include comparing a utilization measurement time for a utilization value with a midpoint between two adjacent different times. There can be a plurality of utilization value times for each of plurality of different times. Calculating the estimated total power draw for one of the different times can include weighting each one in a collection of estimated values based on a difference between a value estimating time for the estimate value and the different time. The method can further include synchronizing time stamps of the different times and the utilization measurement times. Calculating an estimated total power draw can occur at a higher frequency than monitoring actual power draw. The adjusting operation may include adjusting a multiplicity of computers in the plurality of computers to cause the estimated current total power draw value to be closer to the adjusted maximum power draw value. The adjusting operation may include adjusting a multiplicity of computers in the plurality of computers to cause the adjusted estimated total power draw value to be closer to the maximum power draw value.

The systems and techniques described here may provide one or more of the following advantages. First, a system can provide a more accurate estimate of the power to be used in a facility such as a data center. Second, a system can operate closer to the maxim capacity of the power distribution system. Third, the risk of exceeding the maximum capacity can be maintained at an acceptable level. Fourth, the system can help capture deployment or accounting errors if failed or highly skewed correlation is observed.

The details of one or more implementations are set forth in the accompanying drawings and the description below. Other features and advantages will be apparent from the description and drawings, and from the claims.

DESCRIPTION OF DRAWINGS

FIG. 1 shows an example power distribution system for the dynamic estimation of power consumption of power nodes.

FIG. 2 is a block diagram that shows an example of a system for estimating the power draw of an electrical load.

FIG. 3 depicts an example peer-to-peer interaction among intelligent protection modules for the dynamic estimation of power draw of power nodes.

FIG. 4 is a graph of example estimated power consumption and example measured power consumption.

FIG. 5 is another graph of example estimated power consumption and example measured power consumption.

FIGS. 6-8 are flow diagrams of an example processes for estimating the power consumption of an electrical load.

FIG. 9 is a schematic diagram of an example of a generic computer system.

Like reference symbols in the various drawings indicate like elements.

DETAILED DESCRIPTION

This document describes systems and techniques for accurately estimating the power to be used in a facility such as a data center. Electrical devices are generally rated for a maximum current draw, and in some instances these ratings can be somewhat conservative. In addition, the electrical devices may only occasionally, if ever, draw their rated currents. In some instances, power distribution systems can be conservatively built to supply the devices' rated currents. The collective power of the devices connected to branches of the power distribution system may remain conservatively below the breaker limit for their respective branch, and the attached devices may not be drawing their maximum amount of power simultaneously. Overall, a power distribution system may leave some portion of the available power unused, and the amount of unused power may increase as the number of power branches increases.

Hypothetically, the power draw of a computer can be estimated from the utilization of the computer. However, the algorithm that computes power draw from utilization may not be accurate. In addition, even if the computation of the power draw from a single computer is accurate, summation of the power draw of multiple computers may not accurately represent power drawn from a branch line, e.g., due to the presence of other devices on the branch line such as routers, lighting, and cooling systems. However, a load's estimated power draw can be compared to a measured power draw, and that feedback can be used to improve the accuracy of subsequent power draw estimates.

FIG. 1 shows an example power distribution system 100 that includes dynamic estimation of power consumed by nodes in the system 100. The estimations for each node are configured to achieve improved utilization of electrical distribution infrastructure by allowing a network of current protection devices to intelligently estimate power draws of electrical loads in response to dynamic load conditions. For example, in the example depicted in FIG. 1, the power distribution system 100 includes a number of such power monitors, or power monitoring modules (PMMs), that estimate and monitor the amount of power needed by various load circuits under variable load conditions, while protecting the source node against overloads that would exceed the source node's predetermined capacity limit. For example, when current through a PMM approaches full (e.g., 100%, greater than 90%, greater than 85%) utilization of its present capacity estimation, the PMM may generate a request message asking child nodes to limit or reduce their present power consumption. In some implementations, the request message may include a workload limit for the child node. For example, the child node may be an electrical load such as a computer with a controllable clock speed, in which the speed of the computer's clock has a corresponding effect on the amount of power consumed by the computer. In another example, the child node may be an electrical load such as a battery charger or ventilation fan, in which the charge rate or fan speed, respectively, have corresponding effects on the amount of power drawn by the child node. Accordingly, a number of PMMs can operate together to automatically negotiate a capacity sharing arrangement that adapts to the dynamic load conditions to achieve improved utilization of infrastructure power handling capability to distribute power to meet peak power demands at different loads.

The power distribution system 100 includes a facility 102 that receives high voltage and/or current power from an electrical utility provider 104. The facility 102 can include a power substation 106. The power substation 106 transforms the high voltage and/or current power into usable voltages and/or currents for electrical loads in the facility 102, and distributes the transformed power through a power monitoring module (PMM) 107, and on to branch conductor 108 a and a branch conductor 108 b.

The branch conductor 108 a includes a PMM 110 a, and the branch conductor 108 b includes a PMM 110 b. The PMM 110 a provides power monitoring for a circuit 112 that supplies power to an electrical load 114. The PMM 110 b provides power monitoring for a branch conductor 116 a and a branch conductor 116 b. The branch conductors 116 a and 116 b include a PMM 118 a and a PMM 118 b, respectively. Although only two branches are shown at each level of the hierarchy of the power distribution system 100, this is merely illustrative, and any given node could have three or more branches, and there could be additional levels to the hierarchy.

The PMMs 118 a and 118 b provide power monitoring for an electrical load 120 a and an electrical load 120 b, respectively. If the facility is a data center, then the electrical loads 114, 120 a, 120 b include at least some server computers 115, 121 a, 121 b, respectively, such as one or more racks of server computers. In some implementations, the electrical loads 114, 120 a, and 120 b can include computers, collections of computers, racks of server computers, collections of racks of server computers, networking equipment, environmental control equipment, lighting systems, or combinations of these and other appropriate forms of electrical loads.

The PMMs 107, 110 a, 110 b, 118 a, and 118 b, are able to estimate and measure the amount of power consumed by their respective child nodes, and can communicate with each other to allocate power from a shared supply. In some implementations, the PMMs 107, 110 a, 110 b, 118 a, and 118 b can communicate in a peer-to-peer network. For example, the PMM 110 a may send a message to the PMM 107 to request that the PMM 110 b reduce the amount of power being consumed by its child nodes. If the request is granted, the PMM 110 a may then raise the amount of power available to its own child nodes by a substantially like amount. In some implementations, the PMMs 107, 110 a, 110 b, 118 a, and 118 b can communicate with an information management system 122 which includes a database 124. For example, the PMMs may communicate with the information management system 122 to request and/or receive power estimation settings, or to send and/or receive statuses, alarms, notifications, configurations, or other data that may be used by the PMMs 107, 110 a, 110 b, 118 a, and 118 b. In some implementations, the information management system 122 can access the database 124 to store and retrieve information relating to the PMMs 107, 110 a, 110 b, 118 a, and 118 b.

The server computers 115, 121 a, and 121 b, serve information to a number of computers 126 via a wide area network (WAN) 128. In some implementations, the computers 126 can place varying computing loads upon the server computers 115, 121 a, and 121 b. For example, the server computer 115 may host email services, and the server computers 121 a and 121 b may host video sharing services. Demand for these two different services can vary as the amount of traffic from the computers 126 varies. For example, demand for email services may increase in the daytime as users of the computers 126 access their email for work, but in the evening the demand for email services may decrease while the demand for video sharing services increases as people browse videos during their free time.

As the computing loads vary, electrical current needs of the electrical loads 114, 120 a, and 120 b can vary as well. For example, during the day the computing load placed upon the server computers 115 may cause the electrical load 114 to draw 60 A of electrical power in order to operate, while the electrical loads 120 a and 120 b draw 20 A. At night, the server computers 115 may experience lower computing load, and therefore the electrical load 114 may draw 40 A while the server computers 121 a and 121 b may experience increased computing loads such that the electrical loads 120 a and 120 b experience a 70 A draw.

The branch conductors 108 a and 108 b share a supply, illustrated as “Z” amperes. The power monitoring module 110 a is configured to allow a number of amperes of current, designated as “X”, to pass onto the conductor 112, and the power monitoring module 110 b is configured to allow the remaining number of amperes of current, designated by the value “Z-X”, to pass along the conductors 116 a and 116 b. As the electrical current demand of the electrical load 114 varies, the value of “X” can vary as well. For example, Z may be 100 A, and the load connected to the PMM 110 a may draw approximately 75 A, leaving approximately 25 A of capacity available for use by the loads connected to PMM 110 b without exceeding the 100 A supply.

The PMM 110 a may limit the amount of power it passes by estimating the amount of power the electrical load 114 consumes at a given utilization of the server computers 115. The PMM 110 a may determine that the electrical load 114 will consume approximately “X” amperes when the server computers 115 are utilizing N % of their computing capacity. The PMM may request the server computers 115 in the electrical load 114 to limit their combined computing load to N % of capacity, thereby limiting the combined power consumption of the electrical load 114 to approximately “X” amperes.

The estimation, however, may not be completely accurate. For example, the electrical load 114 may actually draw 0.8*X amperes when the server computers 115 are at N % of computing capacity, thus underutilizing the amount of power available to the electrical load 114 (e.g., the server computers 115 could operate at more than N % utilization when the electrical load 114 is using “X” amperes), or the electrical load 114 may actually draw 1.3*X amperes when the server computers 115 are operating at N % utilization, thus consuming more than the “X” amperes allotted to the PMM 110 a (e.g., and trip an “X” ampere circuit breaker, and over-stress the branch conductor 108 a or the substation 106).

The PMMs 107, 110 a, 110 b, 118 a, and 118 b estimate the amounts of power their respective child nodes will draw for their respective utilizations, and compare those estimates to respective measured amounts of power that are drawn by their respective child nodes as they operate at those configured utilizations. The PMMs 107, 110 a, 110 b, 118 a, and 118 b compare the estimated and measured power amounts to determine correlations between utilization levels and actual power consumption. The PMMs 107, 110 a, 110 b, 118 a, and 118 b may then use the determined correlations when performing subsequent estimations of the power draws for various corresponding utilizations.

In some implementations, by intelligently allocating capacity among the PMMs 107, 110 a, 110 b, 118 a, and/or 118 b, electrical power utilization can increase without increasing the electrical supply capacity. For example, PMMs 107, 110 a and 110 b can be initially allocated 50 A each while the electrical load 114 is drawing 20 A, the electrical loads 120 a and 120 b are drawing 20 A each, and the PMMs 118 a and 118 b may be allocated 25 A apiece. The PMM 110 a has approximately 30 A of excess capacity (50 A−20 A=30 A), while the PMM 110 b may have 10 A (50 A−(2×20 A)=10 A). As computing demands change, the electrical load 120 a may need to be reconfigured to draw an estimated 40 A, exceeding the allocation given to the PMM 118 a. In some implementations, the PMM 118 a can request the PMM 118 b to limit its consumption by 5 A. If granted, the PMM 118 b can reduce its estimated power consumption to 20 A and the PMM 118 a can increase its own estimated power consumption to 30 A.

In this example, the PMMs 118 a and 118 b have substantially maximized their use of the 50 A allocated to the PMM 110 b. However, there remains a 10 A shortage along the branch conductor 116 a. In some implementations, the PMM 110 a can request an additional power allocation from the upstream PMM 110 b. For example, the PMM 118 a can request an additional 10 A allocation from the PMM 110 b. However, in this example, the PMM 110 b is already configured to consume its allocated 50 A. In some implementations, the PMM 110 b can send a message to the PMM 110 a to determine if the PMM 110 a has any unused capacity that could be re-allocated to the PMM 110 b.

For example, the PMM 110 b may request a 10 A allocation from the PMM 110 a. Since the PMM 110 a has 30 A of excess capacity, the PMM 110 a may lower its own estimated power usage by 10 A and grant the freed capacity to the PMM 110 b. The PMM 110 b can then raise its estimated power use by 10 A to a total of 60 A, thereby satisfying the power needs of the electrical loads 120 a and 120 b, and increase the utilization of the 100 A available from the substation 106. Additional examples of intelligent power estimation are discussed in further detail in relation to FIGS. 2-8.

FIG. 2 is a block diagram that shows an example of a system 200 for estimating the power draw of an electrical load 202. The system 200 includes a power monitoring module (PMM) 204. In some implementations, the PMM 204 can be the PMM 107, 110 a, 110 b, 118 a, or 118 b of FIG. 1.

The PMM 204 includes a quota allocation module 206. The quota allocation module 206 is communicatively connected to a parent control system 208. The quota allocation module 206 communicates with the parent control system 208 to receive power allocation quotas. For example, the parent control system 208 may determine that the PMM 204 is to be granted a 50 A allocation of power and communicate information about that allocation to the quota allocation module 206. In reference to FIG. 1, the PMM 118 a may receive a power allocation of “Y” amperes from its parent PMM 110 b.

Referring back to FIG. 2, the quota allocation module 206 is communicatively connected to a measurement estimation correlation module 210. The measurement estimate correlation module 210 is also communicatively connected to a current measurement module 220. The current measurement module 220 measures the amount of power flowing from a power source 222 to the electrical load 202 through a circuit protection module 224 (e.g., a circuit breaker). The measurement estimate correlation module 210 communicates with the current measurement module 220 to receive information descriptive of the amount of power being consumed by the electrical load 202.

The measurement estimate correlation module 210 receives a nominal limit value 230 from the quota allocation module 206, and determines an adjusted limit value 232 that is provided back to the quota allocation module 206. The quota allocation module 206 then provides the adjusted limit value to a child control system 240 associated with the electrical load 202. In some implementations, the adjusted limit value may also be provided to the parent control system 208. For example, the parent control system 208 may use the adjusted limit value to improve the accuracy of power allotments. The adjusted limit value is a value that the measurement estimate correlation module 210 estimates the child control system 240 will consume when operating in accordance with the adjusted limit value 232. In some implementations, the child control system 240 may be an electrical load, such as a computer or collection of computers, e.g., the server computers 115. For example, the adjusted limit value 232 may be a utilization value that may limit or otherwise control the computing throughput of the server computers 121 a to control the amount of power consumed by the electrical load 120 a. The PMM 118 a may transmit the adjusted limit value 232 to the server computers 121 a to cause the electrical load 120 a to consume an estimated “Y” amperes.

In some implementations, the child control system 240 may be another PMM. For example, the adjusted limit value 232 may be an amperage limit that the downstream PMM is being requested to honor.

The current measurement module 220 measures the actual amount of power being consumed by the electrical load 202. As such, the measurement estimate correlation module 210 compares the nominal limit value 230, the adjusted limit value 232, and the actual power consumption measurement provided by the current measurement module 220 to modify the estimated correlation between the nominal limit value 230 and adjusted limits to determine a new value for the adjusted limit value 232. The new value for the adjusted limit value 232 is then provided by the quota allocation module 206 to the child control system 240. In some implementations, by using the measured power consumptions of the electrical load at various adjusted limits, the PMM 204 may determine improved adjusted limits that cause the electrical load 202 to consume power at levels that better adhere to respective nominal limits.

FIG. 3 depicts an example peer-to-peer interaction 300 among a hierarchy of power monitoring modules (PMMs) 302 a-302 f for the dynamic estimation of power draw of power nodes. In some implementations, the PMMs 302 a-302 f can be the PMM 204 of FIG. 2, or the PMMs 107, 110 a, 110 b, 118 a, or 118 b o FIG. 1.

When deciding how to set the utilization “U” of the computers in order to achieve a target power “P”, in some implementations one technique would be to use some function U=f(P). In some implementations, each computer receives the target power P from a PMM, and calculates its utilization from U=f(P) using the function f for that computer. The function f can be different for different computers, e.g., the function f can depend on the components in the computer. In some implementations, function f can be described in a lookup table, or by a collection of mathematical formulae in which each formula estimates power consumption for a corresponding utilization subrange.

The problem is that, in some implementations, the function f(P) may not be accurate due to limited measurement samples and workload varieties. In addition, the quota P given to a computer may not accurately represent the real available capacity due to the presence of other devices on the branch line such as routers, lighting, and cooling systems. For example, as previously noted, the function f(P) may not accurately represent power drawn from a branch line, e.g., due to the presence of other devices on the branch line such as routers, lighting, and cooling systems.

To reduce this inaccuracy, actual capacity, often defined by the circuit breaker's hard limit (HL0) of the branch, can be converted to an adjusted capacity (HL1) which is distributable to computers. In some implementations, a dynamic conversion of actual capacity to adjusted capacity can be performed in a three step process. An estimated power draw is calculated from utilization values. A functional relationship between the estimated power draw and the measured power draw is determined. The adjusted capacity is calculated based on the determined function using the actual capacity as the input.

In some implementations, the actual capacity can be converted to adjusted capacity using a function HL0=g(HL1), where g(HL1)=a*HL1+b. The function g can also be considered as a function that converts an estimated power draw E to a corrected estimated power draw M that is in the same parameter space as the measured power draw, e.g., M=g(E)=a*E+b. Parameter “a” generally compensates for proportional error between measured power draw and respective utilization-derived estimated power draw. Parameter “b” generally represents the constant power consumed by equipment but not included in a given aggregation path (e.g. networking equipment, lighting). The parameters “a” and “b” can be periodically updated to adjust for variations in the electrical loads 114, 120 a, 120 b.

In some implementations, parameters “a” and “b” can be determined from aggregated power estimates from the server computers and power drawn by the loads. The power drawn by the load is measured by the PMM at various different times, to create a plurality of measured power values “M”. Similarly, the estimated power drawn by the server computers is collected at various different times to create a plurality of measured utilization values “E”. For example, each of a statistically significant number of the server computers, e.g., all of the server computers, can calculate an estimated power Q drawn by that computer based on the utilization V at that computer, e.g., using the function f with V=f(Q). Each of these computers can report their estimated power draw Q to the PMM, and the PMM aggregates, e.g., sums, the estimated power Q from the individual server computers to generate an estimated power draw E′ for the server computers in the load.

At least one estimated power value “E” is associated with each of at least some of the plurality of measured power values “M”, thereby generating a sequence of pairs of measured power values “M” and estimated power values “E”. For example, a collection of servers may each estimate their own individual power values on a substantially periodic interval (e.g., every 0.1, 1, 5, 10, 30, 60 seconds) and provide those estimated power values V to the PMM from which the servers draw their power. The PMM may then combine those individual estimated values to determine the estimated power value “E”, and compare that estimate to power values M′ that are measured by the PMM relatively less frequently (e.g., every 0.25, 0.5, 1, 2, 10 minutes). Due to potential higher sampling rate of the estimated power values E′, multiple values of E′ can be combined, e.g., averaged, to provide a single estimated power value E′ for a given measured power value M′.

The values “a” and “b” can be calculated by fitting a function, e.g., a linear function, to the pairs of measured power values “M” and estimated power values “E”. In essence, the pairs of values (M′, E′) provide a scatter plot to which the function is fit.

The fitting can generate a function g(x)=a*x+b. When used as a function to convert an estimated power draw E to a corrected estimated power draw M in the parameter space of the measured power draw, the functional relationship between the estimated power draw “E” and the corrected estimated power draw “M” can be a linear function, e.g., using the slope-intercept equation: M=aE+b

Similarly, when considered as a function to convert to actual capacity HL0 to adjusted capacity HL1, the functional relationship can be a linear function: HL0=a*HL1+b

In either case, parameter “a” compensates for proportional error, and parameter “b” represents the constant power consumed by equipment plugged into the power hierarchy but not included in a given aggregation path, such as networking equipment, lighting, and/or other appropriate electrical loads that are substantially constant contributors to a measured aggregate electrical load.

At levels where there is power measuring equipment such as a current measurement module 310, a collection of aggregated model-based estimates (E′) 322 is compared with a collection of actual readings (M′) 324 to determine values for the correlation value “b” and the proportional error value “a”. In the illustrated example, the PMM 302 c has gathered a collection of samples 320 during a recent time window by both aggregating updates from its child nodes (e.g., PMMs 302 d-302 f), and by communicating with the current measurement module 310.

In the illustrated example, fitting of a linear function 330 to the data generates an equation M=1.1 E+0.1, i.e., the proportional error value “a”=“1.1” and the correlation value “b”=“0.1”. As such, when the PMM 302 c determines the adjusted actual through which the 1.2 actual limit is converted to a 1.0 actual hard limit because the equation 330 projects that an estimated usage of 1.0 can lead to 1.2 actual measured power consumption. In use, the current measurement module 310 may not be able to provide readings as fine-grained or time-synchronized as the model-based estimates. In some implementations, multiple estimates may be aggregated to correlate one or more measurement readings. In some implementations, the fitting may be determined such that the model substantially compensates for any time offset between the 2 time series. A graphical example of a linear model of estimated values fitted to a collection of measured values is discussed in the description of FIG. 4.

FIG. 4 is a graph 400 of example estimated power consumption and example measured power consumption. The graph 400 includes a line 410 which represents the ideal case, where measured values “M” are equal to estimated values “E”. The graph 400 also includes a collection of measured values 420. The collection of measured values 420 a-420 f are measured values “M” for power consumption at selected estimated values “E” for power consumption. For example, for a predetermined level of utilization of the server computers 115, the estimated value “E” of the electrical load's 114 power consumption may be “1.0”, however, the actual measured value “M” 420 c may be “1.195”.

A function, e.g., a linear function, represented by a line 430, is fit to the collection of measured values 420 a-420 c. A higher-order polynomial function can be used if a linear function does not fit sufficiently well. In some implementations, the determined solution represented by the line 430 may be used by the PMMs 110 b and 302 c to estimate the allocatable power capacity for the given actual limit. In some implementations, the determined solution represented by the line 430 may be used by the PMMs 107, 110 a, 110 b, 118 a, 118 b, 204, and 302 a-302 f to estimate the actual power consumption of various electrical loads for given estimated power values. In some implementations, PMMs 107, 110 a, 110 b, 118 a, 118 b, 204, and 302 a-302 f may use the determined solution as part of a process to determine utilization and/or other appropriate settings that may cause various electrical loads to consume a predetermined amount of electrical power.

FIG. 5 is another graph 500 of example estimated power consumption and example measured power consumption over time. The graph 500 includes a sequence of measured power consumption values 510 a-510 d. The graph 500 also includes a sequence including a collection of estimated power values 520 a, a collection of estimated power values 520 b, a collection of estimated power values 520 c, and a collection of estimated power values 520 d. In some implementations, the measured values 510 a-510 d, and the values in the collections of estimated values 520 a-520 d may include timestamps of when the different estimations and measurements were taken.

Each of the collections of estimated values 520 a-520 d represents multiple estimated power consumption values taken at various times for a corresponding one of the measured power consumption values 510 a-510 d. In some implementations, estimated power consumption values may be taken multiple times because the estimates may often have higher frequency than then measurements.

In some implementations, a single measured value may be determined for each of the collections of measured values 520 a-520 d. In some implementations, an average value may be determined for each of the collections 520 a-520 d.

In some implementations, each of the collections of estimated values 520 a-520 d may be associated with one of a collection of different times. In the illustrated example, each of the collections of estimated values 520 a-520 d is associated with corresponding measured values 510 a-510 d, taken at different times.

In some implementations, a weighting process may be applied to determine a value to represent each of the collections of estimated values 520 a-520 d. For example, current consumption values that have been estimated at points in time close to when a measured midpoint value has been assigned may be given relatively greater mathematical weight than are values estimated at relatively more distant periods of time away from the measured value. In the illustrated example, a statistical normal curve 530 has been superimposed on the graph 500 and centered on a midpoint 532 when the measured value 510 b was assigned to an electrical load. The curve 530 represents one example type of weighting that may be used to determine the relative influences of the various values included in the collection 520 b (e.g., vertically higher points on the curve 530 represent relatively higher weightings for estimated values than points vertically lower on the curve 530). In the illustrated example, an estimated value 540 a is given a statistically greater weight than an estimated value 540 b because the value 540 a is time-wise closer to the measured value 510 b than the value 540 b.

In some implementations, solutions for estimated power consumption values may be fitted to substantially only the measured power consumption values measured within a predetermined time period. For example, by limiting the fitting process to only consider values measured within a recent window of time, changes in the offset value “b” may be better accommodated (e.g., a cooling fan or light was turned on or off, a network switch was added to or removed from the measured electrical load). In the illustrated example, a time period 550 includes the estimated values 510 b and 510 c and the collections of measured values 520 b and 520 c, and as such, a model may be determined those values while ignoring values from beyond the time period 550 (e.g., values 510 a and 510 d, collections 520 a and 520 d).

FIGS. 6-8 are flow diagrams of example processes for estimating the power consumption of an electrical load. Referring to FIG. 6, an example process 600 starts at step 610. In some implementations, the process 600 may be performed by the PMMs 107, 110 a, 110 b, 118 a, 118 b, 204, and 302 a-302 f of FIGS. 1-3. At step 610, a mathematical function that fits a collection of measured data is sought. At step 620, a region “R” is determined as an entire time offset region. At step 630, the region “R” is split into multiple subregions (e.g., T0 to T1, T1 to T2, . . . ).

At step 640, for each subregion boundary, designated as “Tj”, a fitting process is performed for a collection of measured power consumption values “Mi” and a collection of estimated power consumption values “Ei” determined at the corresponding subregion boundary “Tj” to determine the proportional error value “a” and the offset value “b”. An example fitting process is discussed in the description of FIG. 7.

At step 650, a time “Ti” and the corresponding values of “a” and “b” that produce substantially the least error at time “Ti” are found. At step 660, the region “R” is designated as extending from one time period prior to the time “Ti” to one time period after the time “Ti”.

At step 670, if the size of the region “R” is determined not to be less than a predetermined size “Delta”, then step 630 is repeated. If, however, at step 670 the size of the region “R” is determined to be less than a predetermined size “Delta”, then at step 680 the values for “a”, “b”, and “Ti” are returned.

Referring now to FIG. 7, an example process 700 is illustrated. In some implementations, the process 700 may be performed by the PMMs 107, 110 a, 110 b, 118 a, 118 b, 204, and 302 a-302 f of FIGS. 1-3. In some implementations, the process 700 may be the fitting process performed as part of step 640 of FIG. 6. The process 700 starts with a given collection of measured power consumption values “Mi”, a collection of estimated power consumption values “Ei”, and a time boundary value “Tj”.

At step 710, for each measured value “Mi” and corresponding time “Ti”, all corresponding pairs of “Ej” and “Tj” are found for when “Pj”, which equals the absolute value of “(Tj+D−Ti),” is less than a predetermined threshold value. “Ej” and “Tj” are aggregated to “Ei”, weighted by the mathematical function “1/(Pj+r)”.

At step 720, linear regression is performed on the collection of measured values “Mi” and the collection of expected values “Ei” such that the SUM(f(Ei)−Mi)^2 is substantially minimized where f(Ei)=a*Ei+b. At step 730, the values of “a” and “b” are returned.

Referring now to FIG. 8, an example process 800 is illustrated. In some implementations, the process 800 may be performed by the PMMs 107, 110 a, 110 b, 118 a, 118 b, 204, and 302 a-302 f of FIGS. 1-3. In general, the process 800 describes a technique for comparing estimated power draws and corresponding measured power draws to determine a function that more accurately estimates power draws.

Power is supplied to computers through a power distribution line (step 810). For example, the PMM 110 a can receive “X” amperes through the branch conductor 108 a. The utilization of a sample of computers is monitored (step 820). For example, the server computers 115 in the electrical load 114 may be monitored to determine their utilization (e.g., percent of used computing capacity).

In some implementations, the estimated individual power draw from the computers in an electrical load can be calculated based on the individual utilization of each computer. For example, the electrical load 114 may include fifteen computers drawing an estimated total of 6 A. In such an example, each computer in the electrical load 114 may be drawing a calculated average of 6/15 of an amp, or 0.4 A per computer.

At step 840, the estimated total power draw is calculated at multiple different times, e.g., corresponding to different times at which utilization measurements were collected.

As the utilization is monitored, the actual power draw at the power distribution line is also monitored (step 850). For example, the PMM 204 may use the current measurement module 220 to measure the actual amount of power being passed from the electrical power source 222 to the electrical load 202. Each estimated total power draw value is associated with an actual power draw value based on the time of the measurement, e.g., as discussed for FIG. 5.

A function is fitted to pairs of actual power draw values and estimated power draw values (step 860). In some implementations, the actual power draw values and estimated power draw values are paired time-wise. For example, the measured value 510 b may be paired with the estimated value 540 a because the estimated value 540 a is the estimated value that is the closest, time-wise of the collection of estimated values 520 b to the measured value 510 b. In some implementations, the measured power draw values may be paired with aggregate values that represents collections of time-wise close estimated values. For example, the measured value 510 a may be paired with a mean value of the collection of estimated values 520 a. In some implementations, the PMM 110 a may perform the fitting processes 600 or 700 to determine the fit function represented by the line 430 of FIG. 4. In some implementations, the function can be a linear function. In the example of the line 430, the fit function is determined by a comparison of measured values “M” to estimated values “E”, which generates a proportional error parameter “a” and an offset “b” as applied to the formula “M=aE+b”. In some implementations, calculating the estimated individual power draw can include storing data representing a computer configuration for each of a collection of computers, and the computer configuration can be an input to a function relating utilization to estimated individual power. In some implementations, the computer configuration can include one or more of processor speed, amount of memory or number of disk drives. For example, the PMM 110 a may estimate that the electrical load 114 will draw an estimated 22 A of power at 65% utilization.

In some implementations, calculating the estimated total power draw can include summing the estimated individual power draw of each of the sample of the plurality of computers. In some implementations, the sample can include substantially all of the plurality of computers. For example the electrical load 114 may include twenty computers that are estimated to draw approximately 1.5 A each at 90% utilization. The PMM 110 a may therefore estimate that the electrical load 114 may draw approximately 1.5×20, or 30 A.

At step 870, the function is stored. For example, the PMM 110 a may store the function and then recall and use the function again at a later time, such as to determine an updated estimated power draw for an updated utilization level.

In some implementations, the process 800 may also include steps for receiving a maximum power draw value for the portion of the data center, calculating an estimated current individual power draw value for each of the sample of the plurality of computers based on the utilization, and calculating an estimated current total power draw value from the estimated current individual power draw value of each of the sample of the plurality of computers. In some implementations, the process 800 can also include steps for calculating an adjusted maximum power draw value from the maximum power draw value and the function, and comparing the estimated current total power draw value to the adjusted maximum power draw value. In some implementations, the process 800 can also include steps for calculating an adjusted estimated current total power draw value from the estimated current total power draw value and the function, and comparing the adjusted estimated total power draw value to the maximum power draw value. In some implementations, the process 800 can also include a step for adjusting operation of at least one computer in the plurality of computers based on the comparison. For example, the PMM 110 a may receive an allocation of “X” amperes that can be drawn from the substation 106. To substantially maximize the usage of the allocated power, the PMM 110 a may determine a utilization level for the electrical load 114 that may cause the electrical load 114 to draw an estimated “X” amperes of current.

In some implementations, the adjusting operation may include one or more of adjusting job allocation, adjusting job scheduling or adjusting central processing unit execution frequency. For example, the PMM 110 a may alter the power draw of the electrical load 114 by altering the number of jobs assigned to the electrical load's 114 computers, by altering the scheduling of when computing jobs are to be performed (e.g., load leveling), or by altering the speed of the computers' CPUs.

FIG. 9 is a schematic diagram of an example of a generic computer system 900. The system 900 can be used for the operations described in association with the processes 600-800 according to some implementations. For example, the system 900 may be included in either or all of the PMMs 107, 110 a, 110 b, 118 a, 118 b, 204, 302 a-302 f, and/or the computers 115, 121 a, and 121 b in the electrical loads 114, 120 a, and 120 b.

The system 900 includes a processor 910, a memory 920, a storage device 930, and an input/output device 940. Each of the components 910, 920, 930, and 940 are interconnected using a system bus 950. The processor 910 is capable of processing instructions for execution within the system 900. In one implementation, the processor 910 is a single-threaded processor. In another implementation, the processor 910 is a multi-threaded processor. The processor 910 is capable of processing instructions stored in the memory 920 or on the storage device 930 to display graphical information for a user interface on the input/output device 940.

The memory 920 stores information within the system 900. In one implementation, the memory 920 is a computer-readable medium. In one implementation, the memory 920 is a volatile memory unit. In another implementation, the memory 920 is a non-volatile memory unit.

The storage device 930 is capable of providing mass storage for the system 900. In one implementation, the storage device 930 is a computer-readable medium. In various different implementations, the storage device 930 may be a floppy disk device, a hard disk device, an optical disk device, or a tape device.

The input/output device 940 provides input/output operations for the system 900. In one implementation, the input/output device 940 includes a keyboard and/or pointing device. In another implementation, the input/output device 940 includes a display unit for displaying graphical user interfaces.

The features described can be implemented in digital electronic circuitry, or in computer hardware, firmware, software, or in combinations of them. The apparatus can be implemented in a computer program product tangibly embodied in an information carrier, e.g., in a machine-readable storage device or in a propagated signal, for execution by a programmable processor; and method steps can be performed by a programmable processor executing a program of instructions to perform functions of the described implementations by operating on input data and generating output. The described features can be implemented advantageously in one or more computer programs that are executable on a programmable system including at least one programmable processor coupled to receive data and instructions from, and to transmit data and instructions to, a data storage system, at least one input device, and at least one output device. A computer program is a set of instructions that can be used, directly or indirectly, in a computer to perform a certain activity or bring about a certain result. A computer program can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment.

Suitable processors for the execution of a program of instructions include, by way of example, both general and special purpose microprocessors, and the sole processor or one of multiple processors of any kind of computer. Generally, a processor will receive instructions and data from a read-only memory or a random access memory or both. Typical elements of a computer may include a processor for executing instructions and one or more memories for storing instructions and data. Generally, a computer will also include, or be operatively coupled to communicate with, one or more mass storage devices for storing data files; such devices include magnetic disks, such as internal hard disks and removable disks; magneto-optical disks; and optical disks. Storage devices suitable for tangibly embodying computer program instructions and data include all forms of non-volatile memory, including by way of example semiconductor memory devices, such as EPROM, EEPROM, and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM and DVD-ROM disks. The processor and the memory can be supplemented by, or incorporated in, ASICs (application-specific integrated circuits).

To provide for interaction with a user, the features can be implemented on a computer having a display device such as a CRT (cathode ray tube) or LCD (liquid crystal display) monitor for displaying information to the user and a keyboard and a pointing device such as a mouse or a trackball by which the user can provide input to the computer.

The features can be implemented in a computer system that includes a back-end component, such as a data server, or that includes a middleware component, such as an application server or an Internet server, or that includes a front-end component, such as a client computer having a graphical user interface or an Internet browser, or any combination of them. The components of the system can be connected by any form or medium of digital data communication such as a communication network. Examples of communication networks include, e.g., a LAN, a WAN, and the computers and networks forming the Internet.

The computer system can include clients and servers. A client and server are generally remote from each other and typically interact through a network, such as the described one. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.

A number of implementations have been described. Nevertheless, it will be understood that various modifications may be made. For example, advantageous results may be achieved if the steps of the disclosed techniques were performed in a different sequence, if components in the disclosed systems were combined in a different manner, or if the components were replaced or supplemented by other components. Accordingly, other implementations are within the scope of the following claims. 

What is claimed is:
 1. A method of correlating power in a data center, comprising: supplying power to a portion of a data center through a power distribution line, the portion of the data center including a plurality of computers that draw power through a circuit breaker; monitoring CPU utilization of a statistical sample of the plurality of computers, the statistical sample being a statistically significant number of the plurality of computers that is less than all of the plurality of computers; calculating an estimated individual power draw for each of the statistical sample of the plurality of computers based on the utilization; calculating an estimated total power draw for each of a plurality of different times from the estimated individual power draw of each of the statistical sample of the plurality of computers to generate a plurality of estimated total power draw values for the plurality of different times; monitoring actual power draw at the power distribution line by the portion of the data center and generating a plurality of actual power draw values for the plurality of different times; fitting a function to a plurality of pairs of actual power draw values and estimated power draw values, each pair of the plurality of pairs comprising an actual draw value and an estimated draw value for the same time; and storing the function.
 2. The method of claim 1, wherein calculating the estimated individual power draw comprises storing data representing a computer configuration for each of the plurality of computers, and the computer configuration is an input to a function relating utilization to estimated individual power.
 3. The method of claim 2, wherein the computer configuration includes one or more of processor speed, amount of memory or number of disk drives.
 4. The method of claim 1, wherein calculating the estimated total power draw comprises summing the estimated individual power draw of each of the statistical sample of the plurality of computers.
 5. The method of claim 1, wherein the function is a linear function.
 6. The method of claim 1, further comprising: receiving a maximum power draw value for the portion of the data center; calculating an estimated current individual power draw value for each of the statistical sample of the plurality of computers based on the utilization; calculating an estimated current total power draw value from the estimated current individual power draw value of each of the statistical sample of the plurality of computers; at least one of calculating an adjusted maximum power draw value from the maximum power draw value and the function, and comparing the estimated current total power draw value to the adjusted maximum power draw value, or calculating an adjusted estimated current total power draw value from the estimated current total power draw value and the function, and comparing the adjusted estimated total power draw value to the maximum power draw value; and adjusting operation of at least one computer in the plurality of computers based on the comparison.
 7. The method of claim 6, wherein adjusting operation comprises one or more of adjusting job allocation, adjusting job scheduling or adjusting central processing unit execution frequency.
 8. The method of claim 6, comprising calculating the adjusted maximum power draw value from the maximum power draw value and the function, and comparing the estimated current total power draw value to the adjusted maximum power draw value.
 9. The method of claim 8, wherein adjusting operation comprises adjusting a multiplicity of computers in the plurality of computers to cause the estimated current total power draw value to be closer to the adjusted maximum power draw value.
 10. The method of claim 6, comprising calculating the adjusted estimated current total power draw value from the estimated current total power draw value and the function, and comparing the adjusted estimated total power draw value to the maximum power draw value.
 11. The method of claim 10, wherein adjusting operation comprises adjusting a multiplicity of computers in the plurality of computers to cause the adjusted estimated total power draw value to be closer to the maximum power draw value.
 12. The method of claim 1, wherein calculating an estimated individual power draw for a computer of the plurality of computers comprises measuring utilization of the computer at a plurality of utilization measurement times to generate a plurality of utilization values for the computer, there being a greater number of utilization measurement times than different times such that there are a greater number of utilization values for the computer than different times.
 13. The method of claim 12, further comprising associating each of the plurality of utilization values with one of the plurality of different times.
 14. The method of claim 13, wherein associating includes comparing a utilization measurement time for a utilization value with a midpoint between two adjacent different times.
 15. The method of claim 13, wherein there is a plurality of utilization value times for each of plurality of different times.
 16. The method of claim 15, wherein calculating the estimated total power draw for one of the different times comprises weighting each estimated individual power draw in the plurality of estimated power draw values based on a difference between a value estimating time for the estimate value and the different time.
 17. The method of claim 12, further comprising synchronizing time stamps of the different times and the utilization measurement times.
 18. The method of claim 1, wherein calculating an estimated total power draw occurs at a higher frequency than monitoring actual power draw.
 19. The method of claim 1, wherein the plurality of computers comprises a population of computers in a datacenter.
 20. A data center, comprising: a power distribution line; a plurality of computers that draw power from the power distribution line through a circuit breaker, wherein each computer of a statistical sample of the plurality of computers is configured to generate a measurement of CPU utilization of the computer, the sample being a statistically significant number of the plurality of computers that is less than all of the plurality of computers; a processor configured to receive the measurement of CPU utilization from the statistical sample of the plurality of computer, calculate an estimated individual power draw for each of the statistical sample of the plurality of computers based on the utilization, calculate an estimated total power draw for each of a plurality of different times from the estimated individual power draw of each of the statistical sample of the plurality of computers to generate a plurality of estimated total power draw values for the plurality of different times, receive a measurement of an actual power draw at the power distribution line by the portion of the data center and generate a plurality of actual power draw values for the plurality of different times, fit a function to a plurality of pairs of actual power draw values and estimated power draw values, each pair of the plurality of pairs comprising an actual draw value and an estimated draw value for the same time, and store the function.
 21. A non-transitory computer readable medium storing instructions that, when executed by one or more processors, cause the one or more processors to perform operations comprising: receiving the measurement of CPU utilization from a statistical sample of a plurality of computers, the statistical sample being a statistically significant number of the plurality of computers that is less than all of the plurality of computers, calculate an estimated individual power draw for each of the statistical sample of the plurality of computers based on the utilization, calculate an estimated total power draw for each of a plurality of different times from the estimated individual power draw of each of the statistical sample of the plurality of computers to generate a plurality of estimated total power draw values for the plurality of different times, receive a measurement of an actual power draw at the power distribution line by the portion of the data center and generate a plurality of actual power draw values for the plurality of different times, fit a function to a plurality of pairs of actual power draw values and estimated power draw values, each pair of the plurality of pairs comprising an actual draw value and an estimated draw value for the same time, and store the function.
 22. The computer readable medium of claim 21, wherein the plurality of computers comprises a population of computers in a datacenter. 